List of AI News about GDPval benchmark
| Time | Details |
|---|---|
|
2025-12-11 18:27 |
GPT-5.2 Achieves 70% Expert Preference in GDPval Benchmark, Surpassing GPT-5 in Business Applications
According to Sam Altman, the GDPval benchmark measures how often industry experts prefer the output of an AI model compared to outputs from other experts. GPT-5.2 achieved a 70% preference rate, significantly higher than GPT-5's 38%. This advancement demonstrates the model's superior performance in generating slides, spreadsheets, code, and other business-critical content, suggesting increased business value and reliability for enterprise AI deployments (source: Sam Altman on Twitter, Dec 11, 2025). |
|
2025-09-25 19:52 |
OpenAI Launches GDPval: New Benchmark for Measuring and Forecasting AI Model Progress in Real-World Applications
According to Greg Brockman on Twitter, OpenAI has introduced GDPval, a novel benchmark designed to improve the measurement and forecasting of real-world AI model progress (source: x.com/OpenAI/status/1971249374077518226). GDPval aims to provide a more reliable and standardized framework for evaluating the practical impact and effectiveness of AI systems in real-world business and industry scenarios. This move addresses a critical gap in the AI industry, where existing benchmarks often fail to capture the nuances of real-world deployment. By enabling businesses and developers to better track and predict model advancements, GDPval presents significant opportunities for data-driven decision-making, AI investment strategies, and risk management in enterprise settings (source: OpenAI via Twitter, Sep 25, 2025). |